Figure 1 



Nucleotide sequence of the Prunus amygdalus HNL5 gene 
obtained by PCR amplification 

The start codon (ATG) and stop codon of the 
open reading frame are printed in bold type, and the 
nucleotides in the intron regions are indicated in 
lower case letters. The peripheral sequences which have 
been attached via the PCR primers and which are not 
part of the HNL5 gene are underlined. The splice sites 
of the introns were identified with the aid of the 
consensus sequence " GT . . . . AG" . 

1 GGAATTCACA ATATGGAGAA ATCAACAATG TCAGTTATAC TATTTGTGTT 

51 GCATCTTCTT GTTCTTCATC TTCAGTATTC AGAGGTTCAC TCGCTTGCCA 

1 01 ATACTTCTGC TCATGgtaaa ittccatctt cagtattcat ttaacagcaa 

151 aatgtgtaga ttfataatta agaaaactga cacsagtagt gcaagaaaca 

201 agctaattta gatgcalgtt gaaaaaaatc tttcatctct tcacatatat 

251 WgcagATT TTAGCTACTT GAAGTTTGTG TACAACGCCA CTGATACAAG 

301 CTCGGAAGGA TCATATGACT ACATTGTAAT CGGTGGAGGA ACATCAGGGT 

351 GTCCATTGGC AGCAACTTTA TCAGAAAAAT ACAAGGTGCT TCTTCTAGAA 

401 AGAGGCACTA TTGCTACAGA ATACCCGAAC ACGTTGACTG CAGATGGGTT 

451 TGCATATAAT CTGCAGCAAG AAGATGATGG AAAGACGCCA GTTGAAAGGT 

501 TCGTGTCCGA AGATGGCATT GATAATGTGC GAGCCAGGAT CCTCGGTGGC 

551 ACGACCATAA TCAATGCAGG CGTCTACGCC AGAGCTAACA TTTCATTCTA 



601 



TAGTCAAACA GGAATTGAAT GGGACCTGGA TTTGGTCAAT AAGACATATG 
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651 AGTGGGTTGA AGACGCCATT GTGGTCAAGC CAAATAATCA ATCTTGGCAA 

701 TCTGTTATAG GAGAGGGATT CTTGGAGGCG GGTATTCTTC CAGACAATGG 

75 1 ATTTAGTTTG G ATCACGAAG CAGGAACTAG ACTCACCGGC TCAACTTTTG 

801 ACAATAATGG AACGCGACAT GCGGCTGATG AACTGCTTAA TAAAGGAGAC 

851 CCTAATAACT TGCTAGTTGC AGTTCAGGCC TCAGTAGAGA AGATCCTCTT 

901 CTCTTCCAAT ACATCAAgta tgttgcatca gtgatattta alggtagctc 

951 ctagtttgtc atgctgcact cgaaaattat tattttatca ttttaaaata 

1 001 ctaacagaat agtgtgaagt ctcatatttc ccttccatat ttcccaaatt 

1051 tccaiaaaca aaactlccca atlctccttc gtttagtttg acaataatta 

1 1 01 taagctattc tctaatgcag ATTTGTCAGC TATTGGAGTC ATATATACGG 

1 1 51 ATTCTGATGG AAACTCTCAT CAGGCATTTG TACGCGGTAA CGGAGAAGTT 

1201 ATTGTTAGTG CAGGGACAAT CGGAACGCCT CAGCTTCTAC TACTTAGTGG 

1251 CGTTGGACCA GAGTCTTACC TATCTTCTCT CAACATCACA GTTGTTCAGC 

1 301 CGAATCCTTA TGTTGGGCAG TTTGTGTATG ACAATCCTCG TMTTTCATT 

1 351 AATATTTTGC CCCCAAATCC AATTGAAGCC TCTGTTGTAA CTGTTTTAGG 

1 401 CATTAGAAGT GATTATTATC AAGTTTCTCT GTCAAGCTTG CCATTTTCCA 

1451 CTCCACCCTT TAGTCTTTTT CCTACAACAT CTTACCCCCT CCCAAATTCG 

1 501 ACTTTTGCTC ATATTGTTAG CCAAGTTCCA GGACCATTGT CTCATGGTTC 

1551 TGTCACGCTA AATTCATCAT CTGACGTGAG AATCGCTCCA AATATTAAAT 

1 601 TCAATTACTA TTCAAATTCC ACAGACCTTG CTAATTGTGT TAGCGGCATG 

1 651 AAGAAGCTTG GTGACTTATT AAGGACAAAG GCATTAGAAC CATATAAAGC 
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1 701 TCGAGATGTG CTGGGAATTG ACGGTTTCAA TTATTTGGGA GTACCTTTGC 

1 75 1 C AG AG AACCA AACAGATG AT GCATCCTTCG AAAC ATTTTG TCTAGATAAT 

1 801 GTAGCTTCAT ACTGGCATTA CCACGGTGG A AGCCTTGTTG GGAAAGTGCT 

1 851 TGATGACAGT TTCCGTGTTA TGGGGATCAA AGCATTACGC GTTGTTGATG 

1 901 CCTCCACTTT CCCTTACGAA CCAAACAGCC ATCCTCAGGG CTTCTATCTG 

1 951 ATGTTAGGAA Ggtatgtgat gcacacttcc aaccactaga gattctcaat 

2001 attttgttgt tgttgtaatg aactctctgc cgcattgctc ttttttatta 

2051 atccttaaaa ttttgtgttt tgcgcagGTA TGTGGGCCTT CAAATCCTGC 

21 01 AAGAAAGGTC AATCCGGTTG GAGGCTATTC ATAATATTCA AGAGTCCATG 

2151 TGAAG AATTC CG 
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Figure 2 
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Figure 3 : 

Amino acid sequence of the Prunus amygdalus 
hydroxynitrile . lyase--- (HNL5) , derived from the 
nucleotide sequence of the HNL5 gene. The signal 
sequence determined from sequence analysis is printed 
in bold type and the postulated processing site is 
indicated by an arrow. Possible glycosylation sites 
(PROSITE patterns) are underlined. 

MEKSTMSVtLFVLHLLVLHLQYSEVHSLA NTSA HDFSYLKFV YNATD TSSEGSYDYl 

ViGGGTSGCPLAATLSEKYKVLLLERGTIATEYPNTLTADGFAYNLQQQDDGKTPVE 

RFVSEDGIDNVRARILGGTTIINAGVYARA NISF YSQTGIEWDLDL VNKTY EWVEDAI 

WKPNNQSWQSVIGEGFLEAGILPDNGFSLDHEAGTRLTGSTFDNIWGTEHAADELL 

NKGDPNNLLVAV0ASVEK)LFSSNTSNLSA)GV1YTDSDGNSHQAFVRGNGEV1VSA 

GTlGTPQLLLLSGVGPESYLSSLMTWQPNPWGQFVYDNPRNFINtLPPNPtEASW 

TVLGIRSDYYQVSLSSLPFSTPPFSLFPTTSYPLP NSTFA HiVSQVPGPLSHGSVTLN 

SSSDVRiAPNIKFNYYSNSTDLANCVSGMKKLGDLLRTKALEPYKARDVLGlDGFNJY 

LGVPLPENQTDDASFETFCLDNVASYWHYHGGSLVGKVLDDSFRVMG1KALRWD 

ASTFPYEPNSHPQGFYLMLGRYVGLQILQERSIRLEAIHNIQESM 
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Figure 4 : 

Nucleic acid sequence of the DNA fragment coding for a 
secretory hybrid protein (PamHNL5xG0X) with HNL 
activity, consisting of sequences of the Prunus 
amygdalus HNL5 gene and the Aspergillus niger glucose 
oxidase gene. 

gaattcatcatgcagactctccttgtgagctcgcttgtggtctccctcgctgcggccctgcca- 

cactacaicaggagcaatggcattgaagcctacaacgccactgatacaagctcggaaggatca- 

tatgactacattgtaatcggtggaggaacaicagggtgtccattggcagcaactttatcagaa- 

aaatacaaggtgcttcttctagaaagaggcactattgctacagaaiacccgaacacgtt- 

gacfgcagafgggtttgcatataatctgcagcaacaagatgatggaaagacgccagttga- 

aaggttcgtgtccgaagatggcattgataatgtgcgagccaggatcctcggtggcacgacca- 

taatcaatgcaggcgtctacgccagagctaacatttcattctatagtcaaacaggaatt- 

gaatgggacctggatttggtcaataagacatatgagtgggttgaagacgccattgtggt- 

caagccaaataatcaatcttggcaatcigttataggagagggattcttggaggcggg- 

tattcttccagacaatggaittagtttggatcacgaagcaggaactagactcaccggct- 

caacttttgacaataatggaacgcgacatgcggctgatgaactgcttaataaaggagaccc- 

taataactlgclagttgcagttcaggcctcagtagagaagatcctcttctcttccaatacat- 

caaatttgtcagctattggagtcatatatacggattctgatggaaactctcatcaggcatttg- 

tacgcggtaacggagaagUaltgttagtgcagggacaatcggaacgcctcagcttctac- 

tactiagtggcgttggaccagagtcttaccta}cttctctcaacatcacagtigitcagcc- 

gaatccttatgttgggcagtttgtgtatgacaatcctogtaatttcattaatattttgcccc- 

caaatccaattgaagcctctgttgtaactgttttaggcattagaagtgattattat- 

caagtttctctgtcaagcttgccatittccactccaccctttagtcHtttcctacaacatct- 

taccccctcccaaattcgacttttgctcatattgttagccaagttccaggaccattgtct- 

catggttctgtcacgctaaattcatcatctgacgtgagaatcgctccaaatattaaattcaat- 

tactatfcaaattecacagaccttgctaattgtgitagcggcatgaagaagcttggtgact- 

tattaaggacaaaggcattagaaccatataaagctcgagatgtgctgggaattgacggttt- 

caattatttgggagtacctttgccagagaaccaaacagatgatgcatccttcgaaa- 

cattttgtctagataatgtagcttcatactggcattaccacggtggsagccUgttggga- 

aagtgcttgatgacagtnccgtgttatggggatcaaagcattacgcgttgttgatgcctc- 

cactitcccttacgaaccaaacagccatcctcagggcttctatctgatgttaggaagg- 

tatgtgggccttcaaatcctgcaagaaaggtcaatgcagtgagcggccgcatgcgaattc 
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Figure 5 : Amino acid sequence of the hybrid protein 
PamHNL5xG0X , derived from the nucleic acid sequence 
(figure 4) . 

MQTLLVSSLWSLAAALPHYIRSNGlEAYNATDTSSEGSYDYiVIGGGTSGCPLAATL 
SEKYKVLLLERGTIATEYPNTLTAOGFAYNLQQQDDGKTPVERFVSEDGIDNVRARI 
LGGTTItNAGVYARANiSFYSQTGIEWDLDLVNKTYEWVEDAIWKPNNQSWQSVIG 
EGFLEAG1LPDNGFSLDHEAGTRLTGSTFDNNGTRHAADELLNKGDPNNLLVAVQA 
SVEKILFSSNTSNLSAIGVIYTDSDGNSHQAFVRGNGEV1VSAGT1GTPQLLLLSGVG 
PESYLSSLNITVVQPNPYVGQFVYDNPRNFINILPPNPIEASNA/TVLGIRSDYYQVSLS 
SLPFSTPPFSLFPTTSYPLPNSTFAHiVSQVPGPLSHGSVTLNSSSDVRIAPNIKFNY 
YSNSTDLANCVSGMKKLGDLLRTKALEPYKARDVLGIDGFNYLGVPLPENQTDDAS 
FETFCLDNVASYWHYHGGSLVGKVLDDSFRVMGIKALRWDASTFPYEPNSHPQG 
FYLMLGRYVGLQILQERSMQ 
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Figure 6: Comparison of the amino acid sequences of 
Prunus amygdalus HNL5 and of the hybrid protein 
PamHNL5xG0X. Sequence parts of Aspergillus niger 
glucose oxidase are underlined. Sequence regions having 
no significant homology between the two proteins are 
printed in italics, and the signal peptides are printed 
in bold type. 

PamHNL5Gox 1 mgtllvsslv vslaaalp hy irsngiea- - YNATDTSS 

PamHNLS 1 mekstmsvil Mhtlvlhl qysevhslan tsahdfsylk iVYNATDTSS 

PamHNL5Gox 37 EGSYDYIV1G GGTSGCPLAA TLSEKYKVLL LERGTIATEY PNTLTADGFA 
PamHNL5 51 EGSYDYIVIG GGTSGCPLAA TLSEKYKVLL LERGTIATEY PNTLTADGFA 

PamHNL5Gox 87 YNLQQQDDGKTPVERFVSED GIDNVRARIL GGTTIINAGV YARANISFYS 
PamHNL5 101 YNLQQQDDGK TPVERFVSED GIDNVRARIL GGTTIINAGV YARANISFYS 

PamHNLSGox 137 QTGIEWDLDL VNKTYEWVED AIWKPNNQS WQSVIGEGFL EAGILPDNGF 
PamHNL5 151 QTGIEWDLDL VNKTYEWVED AIWKPNNQS WQSVIGEGFL EAGILPDNGF 

PamHNL5Gox 187 SLDHEAGTRL TGSTFDNNGT RHAADELLNK GDPNNLLVAV QASVEKILFS 
PamHNL5 201 SLDHEAGTRL TGSTFDNNGT RHAADELLNK GDPNNLLVAV QASVEKILFS 

PamHNL5Gox 237 SNTSNLSAIG VIYTDSDGNS HQAFVRGNGE VIVSAGTIGT PQLLLLSGVG 
PamHNLS 251 SNTSNLSAIG VIYTDSDGNS HQAFVRGNGE VIVSAGTIGT PQLLLLSGVG 

PamHNL5Gox 287 PESYLSSLNI TWQPNPYVG QFVYDNPRNF INJLPPNPIE ASWTVLGIR 
PamHNL5 301 PESYLSSLNI TWQPNPYVG QFVYDNPRNF INILPPNPIE ASWTVLGIR 

PamHNL5Gox 337 SDYYQVSLSS LPFSTPPFSL FPTTSYPLPN STFAHIVSQV PGPLSHGSVT 
PamHNLS 351 SDYYQVSLSS LPFSTPPFSL FPTTSYPLPN STFAHIVSQV PGPLSHGSVT 

PamHNL5Gox 387 LNSSSDVRIA PNIKFNYYSN STDLANCVSG MKKLGDLLRT KALEPYKARD 
PamHNL5 401 LNSSSDVRIA PNIKFNYYSN STDLANCVSG MKKLGDLLRT KALEPYKARD 

PamHNL5Gox 437 VLGIDGFNYL GVPLPENQTD DASFETFCLD NVASYWHYHG GSLVGKVLDD 
PamHNL5 451 VLGJDGFNYL GVPLPENQTD DASFETFCLD NVASYWHYHG GSLVGKVLDD 

PamHNL5Gox 487 SFRVMGIKAL RWDASTFPY EPNSHPQGFY LMLGRYVGLQ ILQERSmg- 
PamHNLS 501 SFRVMGIKAL RWDASTFPY EPNSHPQGFY LMLGRWGLQ ILQERS«7e 

PamHNL5Gox 535 

PamHNL5 551 aihniqesm 
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Figure 7: Analysis of HNL preparations by SDS PAGE. 
Details are described in example 11. 
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Figure 8 : 

Nucleotide sequence of the Prunus amygdalus HNL1 gene 
obtained by PCR amplification. 

ATGGAGAAATCAACAATGTCAGCTATACTGTTGGTGTTATACATTTTTGTCCTCC 
ATCTTCAATATTCTGAGGTCCACT 

CGCTTGCCACGACTTCTGATCATGgtsaatcacttcaaccgtaattcaaaacaccaaaaagg- 
caatcaaaaagaaaacg 

gaaaaaagtgtaagaaaagcagaiatagacgcctgcatagatgcatgtgctatatacttttaaaaactcttcgtctctt 

gagattttgcagATTTTAGCTACCTGAGCTTTGCATACGACGCCACTGATCTA- 

GAGTTGGAAGGATCATATGACTACGT 

TATAGTTGGCGGAGGAACATCAGGGTGTCCATTGGCAGCAACTTTATCAGAAAA 
ATACAAGGTGCTCGTTCTCGAAAGG 

GGCAGTCTTCCGACAGCATATCCCAACGTCTTGACTGCAGATGGGTTTGTATAT 
AATCTCCAG CAAGAAGATGATGG AA 

AGACACCGGTCGAAAGGTTCGTGTCCGAAGATGGTATTGATAATGTACGGGGC 
AGGGTGCTCGGTGGCACAAGCATTAT 

CAATGCCGGTGTCTACGCCAGAGCTAACACCTCAATCTATAGTGCATCAGGAGT 
TGATTG G GACATGGATTTGGTTAAT 

CAGACATATGAGTGGGTTGAAGACACTATTGTGTACAAGCCAAATTCTCAATCTT 
GGCAGTCTGTTACAAAAACTGCAT 

TCTTGGAGGCTGGTGTTCATC CAAAC CATGG ATTTAGTTTAGATCATGAAGAAG 
GAACTAGAATTACCGGCTCAACTTT 

TGACAACAAGGGAACGAGACATGCAGCTGATGAACTTCTTAATAAAGGAAACTC 
TAAC AACTTG C G AGTTG G AGTTCAT 

GCCTCAGTAGAGAAGATCATCTTCTCCAATGCACCAGgtatgttgcatcatgcactccaa- 
aattaatattttgtcattt 

iaaaacactagcaggagccaaggtctggaagtacgaataaaatttcattattttccttggatttgtttgataatgatta 
taagcttttctgtaatgtagGTTTGACAGCTACAGGAGTCATATATAGGGATTCTAATG- 
GAACGCCTCACCAAGCATTT 

GTACGCAGTAAGGGAGAAGTTATCGTGAGTGCAGGGACAATTGGGACCCCTCA 
ACTTCTACTACTTAGCGGTGTTGGGC 
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CAGAGTCTTACCTATCATCTCTAAATATTCCAGTTGTTCTTTCCCATCCTTAC6TC 
GGACAGTTTCTGCATGACAATCC 

TCGTAATTTCATTAACATTTTGCCCCCAAATCCAATTGAACCCACAATTGTAACTG 
YTCTAGGCATTTCAAACGATTTC 

TACCAATGTTCTTTCTCGAGCTTGCCATTTACAACTCCACCCTTCGGTTTTTTCC 
CTAGTGCATCTTATCCCCTGCCAA 

ATTCGACTTTTGCTCACTTTGCTAGCAAAGTGGCAGGACCTTTATCATATGGTTC 
TCTCACACTGAAATCATCCTCCAA 

TGTGAGAGTCAGTCCAAATGTCAAATTTAATTACTATTCAAATCTGACAGATCTTT 
CTCATTGTGTTAGCGGCATGAAG 

AAGATTGGTGAACTCTTGAGCACAGACGCATTAAAACCATATAAAGTTGAAGATT 
TGCCGGGTGTAGAAGGTTTTAATA 

TTTTGGGAATCCCTTTGCCAAAGGACCAAACAGATGATGCAGCCTTCGAAACAT 
TTTG C C G AGAATC AGTAG C CTC ATA 

TTGGCACTACCACGGTGGATGCCTTGTTGGAAAGGTGCTTGATGGTGATTTCCG 
TGTTACAGGGATCAATGCATTACGC 

GTTGTTGATGGCTCAACATTCCCTTACACACCAGCGAGCCACCCTCAGGGCTTC 
TATCTGATGTTAGGGAGgtatgtts 

caaattctcaafaaftatttggttgagtggcttgttfgtaatgaactctatgccataittcictttctcatcctticca 
tttttgigccatgggcagGTATGTGGGCATTAAAATTCTGCAAGAAAGATCAGCTTCA- 
GATCTAAAAATCTTGGATTCC 
CTCAAGTCAGCAGCATCCTTGGTTCTTTAAACT 



Figure 9 : 



Amino acid sequence of Prunus amygdalus hydroxynitri 
lyase (HNL1) , derived from the nucleotide sequence 
the HNL1 gene . 



MEKSTMSAILL VL YIF VLHL 0 YSE VHSLA TTSDHDFSYLSFA YDA TDLELEGSY 
D YVI VGGGTSGCPLAA TLSEKYKVL VLERGSLPTA YPNVL TADGFVYNLOQE 
DDGKTP VERFVSEDG1DNVRGR VLGG TS/INA G VYA RANTSIYSASG VD WDM 
DL VNOTYE WVEDTIV'YKPNSOSWOS VTK TAP LEA G VHPNHGFSLDHEEGTRI 
TGSTFDNKG TRHAA DELLNKGNSNNLR VG VHAS VEK1IFSNAPGLTA TG VI YR 
DSNGTPHOAFVRSKGE VIVSA G TIG TPOLLLLSG VGPESYLSSLNJP VVLSHP Y 
VG0FLHDNPRNFINILPPNP1EPTIVTVLG1SNDFY0CSFSSLPFTTPPFGFFPS 
ASYPLPNSTFAHFASKVAGPLSYGSLTLKSSSNVRVSPNVKFNYYSNLTDLSHC 
VSGMKKIGELLSTDALKPYKVEDLPGVEGFNILG1PLPKD0TDDAAFETFCR 
ESVASYWHYHGGCL VGK VLDGDFR VTGINALR WDGSTFPYTPASHPOGFYL 
MLGRYVGIKILOERSASDLKILDSLKSAASLVL 



